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SYSTEM AND METHOD FOR ENHANCING 
OPERATION OF A WEB SERVER CLUSTER 

Field of the Invention 

The invention relates to the field of digital data packet management. More 
5 specifically, the invention relates to the regulating of data flow between a client 
computer and a cluster or group of data servers. 

Background of the Invention 

The evolution over the past twenty years of digital communications 
10 technology have resulted in a mass deployment of distributed client-server data 
networks, the most well known of which is the Internet. In these distributed client- 
server networks, clients are able to access and share data or content stored on servers 
located at various points or nodes on the given network. In the case of the Internet, 
which spans the entire planet, a client computer is able to access data stored on 
15 servers located anywhere on the Earth. 

With the rapid proliferation of distributed data networks such as the Internet, 
an ever-increasing number of clients firom around the world are attempting to connect 
to and access data stored on a finite number of servei s. For example, web site dwneii ' 
and/or operators deploying and maintaining servers containing web pages fi-om their 

20 popular web sites are finding it increasingly difficult to ensure that all requests for 
data and/or access can be satisfied. Each server can support only a finite number of 
concurrent client connections based on the server's computational, storage and 
commiinications capacity. When the number of client requests for content or data 
(i.e., connection requests) exceeds the server's capacity, the clients' connection 

25 requests are generally refiised or dropped shortly after establishing connections, often 
in midstream of receiving the requested content. In extreme cases, the number of 
client requests for content may overload or overwhelm the server as to effectively 
disable the server, i.e., knock the server out of commission. 
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As a partial solution to this problem, the web site owners and/or operators 
typically deploy multiple mirrored servers, each server having identical content. The 
mirrored servers are usually connected to the same local area network and are 
collectively referred herein as a server cluster. In conjunction with the multiple 
5 mirrored servers, the web site owners and/or operators also employ a load balancer to 
distribute the load among the mirrored servers. That is, when a client request a 
connection to one of the servers in the server cluster, the cluster's load balancer 
processes the request to evenly spread the load (i.e., coimection requests) among the 
servers in the server cluster. Based on information regarding the condition of each 
10 server in the server cluster, the load balancer facilitates a connection between the 
client and a server that is capable of handling the client*s request. 

An inherent drawback of this load balancing approach of the prior art is that 
they all utilize a central load balancer. Whether the load balancer is a dedicated 
hardware appliance or a general-purpose computer running load balancing software, 

15 all of the prior art solutions require that a client's connection request be first received 
and processed by a load balancer before the request can be directed to a server. 
Accordingly, the maximum rate at which the entire server cluster can receive and 
respond to client requests is limited by the throughput of the load balancer. Hence, if 
the load balancer's capacity is exceeded, the requests can be ignored or dropped even 

20 if the server cluster has sufficient capacity to process the requests. Another inherent 
drawback of the prior art centralized load balancing system is that the entire server 
cluster can be rendered inoperative if the central load balancer fails. 

AppHcant's pending patent application Serial No. 09/565,259, filed May 5, 
2000, describes a distributed load balancing solution for homogeneous server clusters, 
25 which overcomes the above mentioned drawbacks of the prior art, which is 
incorporated herein in its entirety. In homogeneous server clusters, the member 
servers are interchangeable and each server contains substantially the identical content 
(e.g.,*.HTMLor*.CGI). 

An ever-increasing demand by Internet users for diverse content has prompted 
30 Internet operators (i.e., web sites, ISP's and ASP's) to deploy heterogeneous server 
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clusters composed of servers having different data types. Heterogeneous server 
clusters, also known as asymmetric clusters, are composed of multiple server groups, 
where each group contains at least one server and all the servers in a group contain 
substantially identical content. That is, each group of servers in a cluster stores 
5 different content. Heterogeneous server clusters are particularly useful for storing 
content in a number of different content formats, such as HTML, CGI, streaming 
audio or video, etc. Since each content format has different storage and transmission 
characteristics and requirements, it is inefBcient for web site owners and/or operators 
to employ a single server to provide data in various different formats to clients. When 

10 diverse content in a variety of data formats is required, it is desirable to divide the 
server cluster into groups of servers, where each group of servers processes content 
requests for a limited number of data format, such as one or two particular data 
formats. For example, a commercial web site having content in numerous formats 
may divide the server cluster into three groups of servers: the first group providing 

15 only HTML content, the second group providing only CGI content, and the third 
group providing only streaming audio and video content. 

Content must be updated in real-time on many of today's commercial web 
sites, and with the increasing complexity and number of servers in the server clusters 
used by these sites, the prior art load balancing system often direct a client's request 

20 to a server where the Teqiiestied cbilterit is'either beirig updated or is' stale. Althbiigli 
some of the prior art load balancing system consider the format or type of content 
being requested, none of the prior art load balancing system can detect or determine 
which servers contain the most recent version of the content, and which servers 
contain stale data and require updating. Therefore, although the prior art load 

25 balancing system can direct a chent's request to the appropriate server group, none of 
the prior art load balancing system can assure that the client is being directed to a 
server with the most recent version of the requested content. 

Objects and Simmiary of the Invention 

Therefore, it is an object of the present invention to overcome the 
30 disadvantages of the above-described load balancing system by providing a 
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distributed system and method for balancing client connection load among the servers 
of a heterogeneous server cluster. 

Another object of the present invention is to provide a system and method of 
directing a chent's request for data to a server having the latest version of th e 
requested conte nt. 

A further object of the present invention is to provide a content updating and Qf^ 
distribution system and method which works collaboratively with the distributed load a 'J^^ 
balancing system of the present invention- ^ -^J^Ui*< Z 

The present invention is a computer network load balancing and content*^ 
distribution system, which is highly scalable and opthnizes packet th roughput by I ^^cM^^^^ 
dynamicall y distributing client connections among appropriate servers in a serve r \ 
clustCT . 

In accordance with an embodiment, the present invention includes a server 
cluster having a plurality of server groups, where each group has at least one server. 
15 All servers in the cluster have a common network address, and are connected to a v 
network such that each server receives a chent's connection request at substantially^^ i^V^ 




Q^x^ the same time. [Each server has a load balancing module which generates ^ 

I connection value^ for each connection reques t received, b y the server jf A particiila T 
^•5^ -^,servsf in the server cluster accent^ and pmce5;<;RR the network conniSon request 
r^r 20 based on the computed connection value of the reque^ That is, t he cluster has rang e 
of c onnection values and each server is associated with a non-overlapping sub-rang e 
of connection values associated with the cluster and accepts only connection requests 
having connection values within its associated sub-range. Each server^s sub-ranpe is 
dy naniically adjusted based on its available capacity, v^ere the size of a server's su b- 
25 range^elative to the entire ran^^ }<i appmyiniatel v proportional to the <;R^ Pr>Q 
avglable capacity relati ve to the entire cluster's available capacity . The load 
balancmg modules on each of the server in the cluster communicate information 
relating to their server's available capacity to each other. 

[^Upon estabUshing an initial connection with a client, a server according to the Q^J^ [ <f 
30 present invention includes a dding module for reading the client's request in nrde^ t^j ^ (yi^^ 
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determine whether it has the requested contentj ^K^ the requested content does not 
reside on the accepting server or a more recent version of the content can be found on 
another server in the cluster having sufficient available capacity to accept a 
connection from the chentl^e accenting server rfiHi^erifi^<^ client corm^on request 
to that other server, which is referred to herein as a destination serv3T^ if 

reques t and transmits the requested content to the client. \ ^ — 

In accordance with another embodiment, the distributed load balancing system 
of the present invention supports persistent sessions using cookies and/or secure 
sockets layer (SSL) identification tags. ^The l oad balancing module of the present 

1 i nvention recognizes cookies and SSL identification tags, and directs the connections 
t o the appropriate server or group of servers based on those recognized cookies and ^ 
SSLjtags. I 

Working in conjunction with the load balancing system, a content distribution 
system of the present invention distributes and updates content to servers in a server 
cluster. The content distribution system includes a storage area for storing the content *1 
to be distribute d, a file transfer module for copying the content to servers in the — ^ 
cluster, and data tables for storing information regarding the freshness (e.g., version 
number, last edit or updated date, etc.) and availability of content stored on each 
server in the cluster^ 

Various other objects, advantages, and features of this invention will become 
readily apparent from the ensuing detailed description and the appended claims. 




Brief Description of the Drawings 

The following detailed description, given by way of example, and not intended 
to hmit the present invention solely thereto, will best be understood in conjunction 
with the accompanying drawings: 

Fig. 1 is a diagram illustrating a heterogeneous server cluster in accordance 
with an embodiment of the present invention; 
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Fig. 2 is a diagram illustrating a client computer establishing a connection 
with a server in the server cluster in accordance with an embodiment of the present 
invention; 

Fig. 3 is a diagram illustrating a client connection being redirected from one 
5 server to another in the server cluster in accordance with an embodiment of the 
present invention; 

Fig. 4 is a diagram showing an example of a data flow when a client 
connection is redirected from a first server to a second server. 

Fig. 5 is a diagram showing range and sub-range values for servers within a 
10 server cluster in accordance with an embodiment of the present invention; and 

Fig. 6 is a diagram showing a content distribution system in accordance with 
an embodiment of the present invention. 



Detailed Description of the Invention 

15 The present invention is readily implemented using presently available 

commxmication apparatuses and electronic components.- The invention finds ready 
application in a private or public communications network utilizing a heterogeneous 
. server cluster. ,Jt is appreciated that the communications network can represent the 
Internet, a computer network, wireless network, a satellite network, a cable network 

20 or any other form of network capable of transporting data locally or globally. 

Server Cluster Configuration 

Turning now to Fig. 1, there is illustrated an example of a heterogeneous 
server cluster 100 comprising: a first group of servers 110 containing *.cgi content, 
such as servers 10a and 10b; a second group of servers 120 containing *.html content, 
25 such as servers 10b to lOd; and a third group of servers 130 for processing cookie 
sessions, such as servers lOe and lOf. All the servers 10 are connected to a common 
router 30. Although not shown in Fig. I^Vhr mnt q- '^0 rp^p ;ivpg m mhnnyn^ nMmt 
request and m ulticasts thejreceived request to all the servers 1 0 in tb^ /-incfr^ ] Oft J As 
exemplified by the server 10b, the same server may belong to more than one group 
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within the clxiste r. Whether a server belongs to a particular group is determined by 
the content stored on that server. Server 10b belongs to both *.cgi group 110 and 
*.html group 120 because it contains content in both *.cgi and *.html formats. 
Whereas other servers containing content in only a single format belong to only one 
5 of the three groups in the cluster 1 00. 

Although Fig. 1 shows o nly one router 30, its is appreciated that multiple 
routers can be used in a cascading and p arti ally overl appin g roT^ fig^iratj on as shovm in 
^plicant^s p rior patent application, Serial No. 09/565,259, which is incoiporated 
herein in its entirety. 

10 Establishing an Initial Connection 

^ Turning now to Fig. 2, there is illustrated an ex aniplft pf a rMp^ f c omputer 60 
establishing a connection with a load balanced server cluste r in arrnrdanrft with an 
embodiment of the present invention. The load h^^^r^^mp fAofiTiiqit^s disclnsftd tn 



applicants' pending patent application Serial No OQ/SfiS 7SQ I'g ng/>f^Jr>loi»H balance 



15 t he client computer's 60 initial connec tions to the heterogeneous s erver cluster of th e 
present mvention. initiation, the router 30 multicasts or broadcasts an address <J 
resolution protocol ("ARP") packet to all the servers in the cluster 100. The ARP 
packet is used to dynamically bind the virtual IP address 2.2.2.2 of the cluster 100 to 
the real IP addresses of the servers 10 in the cluster 100. In response to the ARP 
20 packet, the servers 10 respond with a special multicast address, such as 
01 :00:5E:75:C9;3E/IP 224.1 17.201.62, and not their real MAC (media access control 
or hardware ethemet) address to the router 30. The router 30 stores the real IP 
addresses of the servers 10 in its ARP cache, and all incoming packets addressed to 
the virtual IP address 2.2.2.2 are thereafter multicast to the corresponding real IP 
25 addresses of the servers 10. 

^In accordance with an embodiment of the present invention, each server 10 ^ \ 

includes a receiving module 210 for receiving a request and a load balancing module ^ ^ 

12 for evaluating or detennining whether to pass the client request received by the 
server to the server's TCP/IP stack. Upon receipt of a client request by the receiving 
30 modules of the servers 10, only one of the load balancing modules 12 residing in the 
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10 



30 



servers 10 passes the client request to its TCP/IP stack, thereby insuring that the ^yl^yJ^. 
r equesting client establishes connection with only one server 10 the cluster 
That is, the load balancing modules 12 residing in the other servers 10 in the cluster 

fl 




100 discard the chent request^ a accordance withan aspect of the present invention, 
each load balancing module 12 evaluates a cliCTt request by y signing the client 1 
request a connection valu^The connection value is a snhRtantiany rj a ^dom numbe r 
h aving an equal probabiUty of being anywhere Avithin a fixed range, e.g., 0 to 32,000.^ 
For example, the loading module 12 can generate the connection value usingf^a 
hashing function on a predefined portion of the data packet comprising the request 
Since each load balancing module 12 performs the same hashing function on a given 
request, the same connection value is generated by all the load balancing modules 12 
for each request. 

A load bala n cing module 1 2 perrnjtc ifo /^r^iy^yp^nding f rn^nr tn n rrq>t rli r p t 
reguests ^.e., establish a connection or pass the rfiqnests tn thp TPp/TP stank;^ h^^^'^ g 
certain connection values. For example, as shown in Fi f y % tluR l^g id balancing^ 
module 12b residing in the server 10b accepts onlv requests having connection values 
fro m 10,001 to 20,000. If the client's request has a connection value of 22,000 , then 
the load balancing module 12c pa sses the RVN packet associated with the clie nt's 
request to the TCP/IP stack of the server 10 c. The synchronizing segment (SYN) is 
the first segment sent by the TCP protocol and is used to synchronize the two ends of 
a connection in preparation for opening a connection. Whereas the load balancing 
modules 12a and 12b discard the SYN packet because the connection value is outside 
their acceptable range of connection values^^^ 

Each server is assign^ a range of connection values as a function of its 
available capacity in relation to the overall available capacity of the cluster. That is, a 
SCTver havi ng a great er capacity to accept new requests for connection is assigned a 
g eater number or range of connection values. In accordance with an embodiment of 
the present invention, each server 10 includes an agent 14 that intermittently 
broadcasts infonnation regarding the available capacity or connection availability of 
its associated server to other servers 10 in the cluster 100. Preferably, each server 
stores the available capacity information of other servers in the cluster 100. A server's 

8 
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connection availability is directly proportional to its overall available capacity and 
inversely proportional to its current connection load. In other wnrris^ the rai ^fye of 
each server's assigned connection value is substantially jj roportional to the server's 
connection availabihty relative to the overall connection availability of the cluster 
100. For example if a server 10a has thirty percent (30%) of the connection 
availability of the cluster 100, then thirty percent (30%) of the cluster's connection 
values will be assigned to the server 10a. Preferably, each server' s assigned range of 
the connection values is co ntinuously updated as a function of its available capacity or 
connection availability, which may change over time. 

10 2i server becomes inoperative or disabled, the connection value s of the 

disabled s eryer is assigned to the remaining servers in the cju ster 100. For example, if 
server 10a is disabled and each remaining server now has fifty percent (50%) of the 
available capacity, then the servers 10b and 10c are now respectively assigned 
connection values from 0 to 16,000 and 16,001 to 32,0()o!Ja1so, during a transition 
15 period wherein the servers are assigned new range of connection values, a server's 
range of connection values may temporarily overlap with ano ^^^r g^^^f^^'g n*"f^^i ^ ^ ■ 
a co nnection value may be assigned to more than one serye r. In such a scenario, the 
Connection request may be accepted by two servers, but only one connection will be 
generally established since most conventional network protocols have mechanism to 
20 ]j' resolve such con^ example; under the TCP/IP protocol, if two iservere acfcep ' 

client's connection request and respond by transmitting their own SYN 
acknowledgement (ACK) packets to the cUent computer 60, the client computer 60 
will only accept one SYN ACK packet and reject the other, thereby establishing a 
connection with only one server. 

25 Redirecting Connection 

Turning now to Fig. 3, there is illustrated an example of a chent connection 
being redirected from one server to another in the server cluster in accordance with an 
embodiment of the present invention. In Fig. 3, the server lOe (referred to herein as 
the original server) redirects a cUent's 60 connection request to a second server 10a 
30 (referred to herein as the destination server). After a connection is estabhshed 
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between the client 60 and the server lOe, the cUent 60 sends several data packets . 
typically known as PUSHQ packets, to the server lOe .jpThe PUSHQ packets 
collectively form a header which identifies the requested content of thp r^ i ^if f.n The 
server lOe includes a readmg module 220 (Fig. 2) for reading the header (i.e., the 
PUSH 0 packets) and determining whether its storage device (not shown) has the 
r equested ^on tent Jpor example, if it is determined that the requested content 
Mailable firom the server lOe, the load balancing module 12e on the server lOe 
permits the server lOe to transmit ttie requested content to the client 60."^ 

However, if the requested content is a CGI script and resides only in the *.cgi 
^Pj^group 110 (the servers lOa and 10b)^The load balancing module 12ejelfi£tsa serv( 
in the *.cgi group 110 based on its stored available capacity information of oth 
servers in flie cluster 1 10, particularly servers 10a and 10b!^For example, if the servg 



I ra 





10a has greater available capacity than the server 10b, then the server lOe redirects 
the client's connection to server lOa using a TCP/IP connection protocol, UDP 
15 y^tocol, or other comparable IP level protocol.^^ 

"^Altematively, the original server may redirect the client request to the"7 
destination server to maintain a persistent session with a particular server. Thel^J^M 
destination server can be identified using cookies and SSL tags. It is appreciated that 



20 



25 



this can be used to limit a client access to particular servers or to maintain data 
integrity by allowing the client to access the content or data fi-om the same content 
source, i.e., firom the same server. - 

A technique of redirecting a connection firom one server to another in 
accordance with an embodiment of t he present invention is shown in Fie . 4. The load 
balancing modules of the original and destination servers process or control the tasks 
involved in redirecting the connection. The redirection process is described in 
conjunction with the Figs. 3 and 4. The load module 12e initially accepts the cUent 
request and establishes a connection with the chent 60. If the load module 12e 
determines that another load module within the cluster 100, such as the load module 
12a, is better suited to provide the requested content, then the load module 12 
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t ransmits the c lient's connection information to A e loa d module 12a and ten niaates 
its connection with the client 60. 

In other words, if the load balancing module 12e determines that another 
server in the cluster 100, such as the destination server 10a, should continue with the 
5 established connection or conversation, the load balancing module 12e transmits the 
information indicative of the client's connection, such as the PUSHQ data packet, the 
source IP, the source port, and a sequence number of the SYN packet, to the 
destination server 10a. The load balancing module 12a of the destination server 10a 
uses the packets received fh>m the original server lOe to alter the state of its TCP/IP 
10 stack, thereby replicating the state of server lOe. 

More specifically, the load balancing module 12a uses the information 
received &om server lOe to generate a SYN packet having a source IP, source port 
and SYN sequence number identical to the SYN packet originally received by the 
server lOe. In accordance with an embodiment of the present invention, the newly 

15 generated SYN packet appears to the server lOa as if it originated from the client 60 
and is passed or injected into the TCP/IP stack of the server 10a. The TCP/IP stack 
attempts to reply with a SYN/ACK packet, but the load balancing module 12a 
intercepts and discards the SYN/ACK packet. Consequently, the supplied data 
packets (PUSH) are injected into the TCP/IP stack of the destination server 10a and 

20 the destination server 10a is effectively brought into synch with the original server 
lOe, with respect to the connection with the client 60. Once the connection is 
successfully redirected and the destination server 10a is in synch with the original 
server lOe, the original server lOe tenninates its connection with the client 60. In 
accordance with an aspect of the present invention, the load module 12e can push or 

25 inject a FINQ packet into the TCP/IP stack of the server lOe to terminate the 
connection between the server 12e and the client 60. In response to the FINQ packet, 
the TCP/IP stack generates and transmits a FIN/ACK reply, which is intercepted and 
discarded by the load balancing module 12e: 

Choosing a Destination Server 
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Turning now to Fig. 5, there is illustrated a technique for detennining the 
destination server to redirect the client's connection by the original server in 
accordance with an embodiment of the present invention. The original server that has 
accepted and established a connection with a client 60 may determine for one of 
several reasons that another server in the cluster 100 is better suited to handle the 
Ghent's request. For example, the original server may redirect a client's connection if 
it does not have the requested content or the latest version of the requested content If 
th e client 60 reque sts CGI content, the o riginal server lOe of Fi g. 5 belonffln&_tp the 
c ookie server group 130 will hkely redirect the client' s connecti on sincejtj dpes not 
have jhe requested contgi T^e. T herefore, the load balancing module 12e must 
\^ d etermine or evaluate which other server 10 in the cluster 100 can provi de the 
requested content to the client 6 0. Each load balancing module 12 includes a record 
or information regardingthe data format(s) of all the server groups in the cluster 100. 
Accordingly, the load balancing module 12e utilizes its stored data format iTifnrm:^tif>T) 
15 to determine that servers 10a and 10b are likely to contain the requested CGI content. 
C hice the orig i nal server lOe determines which server p ron p to redirect the clien t's 
c onnection, the original server lOe selects a particular server within that grou p based 
on cert^n parameters, such as the available capacity of the servers, etc. According to 
an aspect of t he present invention,, the original server lOe redirects the client's 
20 connection to a server having the highest available capacity in the appropriate 
destination server group. 

In accordance with an embodiment of the present invention, the original server 
multicasts a redirection packet to each server in the destination group. Each server in 
the group is assigned another range of connection values as a function of its available 
25 capacity in relation to the overall available capacity of the group. T hat is, each serve r 
is assigned a range of connection values based on its available capacitv in relatio n to 
the overall available capacity of the clust er (i.e., at the cluster level) and another range 
based on its available capacity in relation to the overall capacity of the group (i.e., at 
the group level). As illustrated in Fig. 5, the server 1 Of belonging to the cookie group 
30 130 has connection values 25,001 to 32,000 with respect to the cluster 100 and 15,001 
to 32,000 with respect to the cookie group 130. Also, a server belonging to multiple 

12 
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groups has a multiple range of comection values at the group level. For example, in 
Fig. 5, the server 10b belonging to both the *.cgi group 1 10 and the *.html group 120 



/ 15,001 to 32,000 for the *.cgi group and 0 to 10,000 for the *.html group. Upon 
5 receiving the redirection packet, each server in the destination group performs an 
i dentical hashin g function on a portion of the redirection packet, such a s the header, to 
generate a second connection v alue. The server in the destination group that is 
assigied^the second connection value accepts the redirection packet and establishes a 
connection with the client 60. 

10 In accordance with another embodiment of the present invention, the original 

server 10 utilizes a hashing function to select the appropriate server in the destination 
server group. For example, the original server maintains a group level table 
^ containing the range of connection values that are assigned to each server injhe 



es tination group . That is, the original server performs a second hashing function to 
15 generate a second connection value, and redirects the connection to the server in the 
destination group that is assigned the second connection value. 

Content Distribution and Availability 



. connected to the server cluster 100 via the router 30 in accordance ..with- an. 

20 embodiment of the present invention. The content distribution system 40 includes a 
storage area 42 for storing content to be distributed to the servers 10 and a File 
T ransfer P rotocol rTTP"'^ module44fhr transporting a copy of the stored conte nt 
from the storage area 42 to each server 10 in the cluster 100 via the router 30 . The 
content distribution system 40 also includes an update table 46 for storing records that 

25 indicate the status of each content distributed to each server 10 in the cluster 100. 

During the file transfer process, i.e., when the FTP module 44 copies (or 
updates) a particular content from the storage area 42 to one or more servers 10 in the 
cluster 100, the content distribution system 40 changes the corresponding records in 
the update table 46 to indicate that the co nte nt being updated is currently 
30 "unavailable" on those servers. Accordingly, for example, when a load balancing 




has two range or sets of connection values at the group level, connection values 




Turning now to Fig. 6, there is illustrated a content distribution system 40 
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module 12e of the server lOe (Figs. 3 and 5) selects an appropriate destination server 
for redirecting a connection request for specific content, the load balancing mo dule 
1 2e examines the update table 46 to de termine if the requested content is 
" unavailable" on any server and disregards or ignores all such servers m its selection 
-iroc553. Preferably, the content distribution system 40 updates only a subset of the 
servers in a server group at any given time, thereby always providing at least one 
server fix>m each group to process clients* requests even if the requested content is 
currently being updated by the FTP module 44. Once a predetemined or threshold 
number of servers are updated with a new version of the content, the content 
distribution system 40 modifies the corresponding records in the update table 46 to 
indicate that the servers containing the old version of the content arc "unavailable ". It 
i s appreciated that the thre shol d number can be any value fi-om 5% to 95% of the total 
number of servers being updated. 

Each time a particular content is copied to a specific server by the FTP module 
44, the content distribution system 40 modifies the corresponding record to indicate 
the status change of that particular content wdth respect to that specific server. In 
accordance with an embodiment of the present invention, the record is changed to 
indicate that the content is now "available." Preferably, the record also indicates the 
"fireshness," the date and time of the update, or the current version of the content, 
thereby enabling the load balancing module 12 to distinguish between servers having 
older and newer versions of the same content. It is appreciated that a record for a 
specific piece of content on a specific server can indicate the time and date the content 
was last updated or it can indicate a version value for that content. The standard 
convention is to assign a higher version value to the latest or newer version of the 
content. Therefore, a load balancing module 12 uses the update table 46 to select an 
original server or a destination server (for redirecting a client's connection) with the 
latest version of the content, i.e., a server corresponding to a record with the highest 
version value for said content. 

While the present invention has been particularly described with respect to the 
illustrated embodiment, it will be appreciated that various alterations, modificati ons 
and adapta tions mav be made on the present disclosure, and are intended to be within , 

14 
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the^scoge ^of the present inventioD. It is intended that the appended claims be 
interpreted as including the embodiment discussed above, those various alternatives, 
which have been described, and all equivalents thereto. 
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What is claimed: 

1. A method for balancing connection load among servers in a heterogeneous 
server cluster, comprising the steps of: 

determining by each server in said cluster whether a connection request having 
at least information regarding a requested content can be accepted; 

accepting said request by a server if it is determined that said server can accept 
said request; 

reading said request to detennine if said requested content resides in said 

server; 

redirecting said request to another server if it is determined that said content 
does not reside in said server. 

2. The method of claim 1, fiirther comprising the step of assigning a non- 
ovCTlapping range of connection values from a plurality of connection values to each 
server in said cluster , said plurahty of connection values heitiy. associated with said 
^jHE^i and wherein the step of detcmiinin^ include5: the ct^ of gcnemtinga 
connection value for said request; and wherein the step of a c cepting includes the stg i 
of selecting a server associated with said coi mection value. u 

3. The method of claim 2, wherein the step of assigning includes the steps of: 
determining an available capacity of each server in said cluster and an overall 

availabl e capaci ty of said cluster, 

detenmining a proportional available capacity of said each server widi respect 
to said overall available capacity; and 

assigning a range of connection values to said each server in accordance with 
said proportional available edacity of said each server. 

4. The method of claim 2, wherein said connection value is a substantially 
random number selected from said plurality of connection values. 

5. The method of claim 2, wherein said request comprises at least^^one_Jata 
^gadcgt; and wherein the step of generating performs a hashing function on a 
predefined portion of said data packet to generate said connection value for said 
request. 
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6. The method of claim (, further comprising the step of grouping said servers in 
said cluster in accordance with stored content format of said servers to fonn one or 
more groups, wherein each server in a group contains substantially identical content. 

7. The method of claim 6. wherein the step of redirecting includes the step of 
selecting a destination group in said cluster in accordance with said requested content. 

8. The method of claim 7, wherein the step of redirecting includes the steps of: 
assigning a non-overlapping range of group connection values fio m a pluiahtv 

of group connection values to each server i n said destination 

generating a group connection value for said request; and 
selecting a destination server in said destination group associated with said 
group connection value. 

9. The method of claim 8. wherein the step of assigning group connection values 
includes the steps of: 

determining an available capacity of each server in said group and an overall 
15 available edacity of said destination group; 

determining a proportional available capacity of said each server in said group 
with respect to said overall available capacity of said destination group; and 

assigning a range of group connection values to said each server in said group 
in accordance with said proportional available capacity of said each server in said 
20 destination group. 

10. The method of claim 7, further comprising the step of storing records in an 
update table, each record having at least a version value of each content residing in 
each server in said cluster. 

11. The method of claim 1 0, wherein the step of redirecting includes the step of: 
reading records corresponding to said requested content for each server in said 

destination group; and 

selecting said destination server in said destination group with the highest 
version value for said requested content. 

12. The method of claim 10, wherein said record further includes availabiUty 
information of said requested content; and wherein the step of selecting includes the 
steps of determining if said requested content is unavailable from any server in said 
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destination group to provide unavailable servers and inhibiting the selection of said 
unavailable servers as said destination server. 

13. A distributed system for balancing connection load among servers in a 
heterogeneous server cluster, comprising: 

a plurality of servers, each server comprising: 

a receiving module for receiving a connection request from a client, 
each request having at least information regarding a requested content; 

a load balancing module for determining whether said request can be 
accepted by said server and designating said server as a first server if it is determined 
that said server can accept said request; 

a reading module for reading said request to determine if said 
requested content resides in said first server; and 

wherein said load balancing module of said first server is operable to redirect 
said request to a second server in said cluster if it is determined that said content does 
not reside in said first server. 

14. The system of claim 13, wherein said cluster being associated with a plurality 
of connection values; wherein each server being assigned a non-overlapping range of 
connection values fi^om said plurality of connection values; and wherein said load 
balancing modules are operable to generate a connection value for said request to 
determine which server is associated with said connection value to detennine said first 
server. 

15. The system of claim 14, wherein each server includes an agent for determining 
an available capacity of said server, broadcasting said available capacity to said 
plurality of servers in said cluster, and detemuning an overall available capacity of 
said cluster; and wherein said range of connection values being assigned to a server as 
a fimction of said available capacity of said server and said overall available capacity 
of said cluster. 

16. The system of claim 14, wherein said connection value is a substantially 
random number selected firom said plurality of connection values. 

17. The system of claim 14, wherein said request comprises at least one data 
packet; and wherein said load balancing modules are operable to perform a hashing 
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function on a predefined portion of said data packet to generate said connection value 
for said request. 

18. The system of claim 13, wherein said servers in said cluster are grouped in 
accordance with stored content format of said servers to form one or more groups, 

5 wherein each server in a group contains substantially idoitical content 

19. The system of claim 18, wherein said load balancing module of said first 
server is operable to select a destination group in said cluster in accordance with said 
requested content. 

20. The system of claim 19, wherein said destination group being associated with 
) a plurality of groiqj connection values; wherein each server in said destination group 

being assigned a non-overlapping range of connection values fiom said plurality of 
connection values; and wherein said load balancing modules are operable to generate 
a group connection value for said request to determine which server in said 
destination group is associated with said group connection value to determine said 
second server. 

21 . The system of claim 20, wherein each server in said destination group includes 
an agent for determining an available edacity of said server, broadcasting said 
available capacity to said servers in said destination group, and determining an overall 
available capacity of said destination group; and wherein said range of group 
connection values being assigned to a server in said destination group as a fimction of 
said available capacity of said server and said overall available edacity of said 
destination group. 

22. The system of claim 1 9, fiulher comprising an update table for storing records, 
each record having at least a version value of each content residing in each server in 
said cluster. 

23. The system of claim 22, wherein said load balancing module of said first 
server is operable to read records corresponding to said requested content for each 
server in said destination group fiiom said update table and select said second server in 
said destination group with the highest version value for said requested content. 

24. The system of claim 22, wherein said record fiuther includes availability 
information of said requested content; and wherein said load balancing module of said 
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first server is operable to determine if said requested content is unavailable fiom any 
server in said destination group to provide unavailable servers and to inhibit the 
selection of said unavailable servers as said second server. 

25. A method for balancing connection load among servers in a heterogeneous 
server cluster, comprising the steps of: 

determining by each server in said cluster whether a connection request having 
at least information regarding a requested content can be accepted; 

accepting said request by a server if it is determined that said server can accept 
said request; 

reading said request to determine if a latest version of said requested content 
resides in said server; 

redirecting said request to another server if it is determined that the latest 
version of said content does not reside in said server. 

26, A distributed system for balancing connection load among servers in a 
heterogeneous server cluster, comprising: 

a plurality of servers, each server comprising: 

a receiving module for receiving a coimection request fh)m a client, 
each request having at least information regarding a requested content; 

a load balancing module for determining whether said request can be 
accepted by said server and designating said server as a first server if it is determined 
that said server can accept said request; 

a reading module for reading said request to determine if a latest 
version of said requested content resides in said first server; and 

wherein said load balancing module of said first server is operable to redirect 
said request to a second server in said cluster if it is determined that the latest version 
of said content does not reside in said first server. 
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